# OCR-free document understanding
Donut Base Encoder
MIT
Donut is an OCR-free document understanding Transformer model that directly processes document images through a visual encoder
Text Recognition
Transformers

D
eljandoubi
45
0
OCR DocVQA Donut
MIT
Donut is an OCR-free document understanding Transformer model that combines a visual encoder and text decoder for document visual question answering tasks.
Image-to-Text
Transformers

O
jinhybr
240
13
OCR Donut CORD
MIT
Donut is an OCR-free document understanding model based on Swin Transformer visual encoder and BART text decoder, this version is fine-tuned on CORD receipt dataset
Image-to-Text
Transformers

O
jinhybr
1,130
206
Featured Recommended AI Models